Dataset statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Number of variables | 7 | 7 |
| Number of observations | 22620 | 20162 |
| Missing cells | 2458 | 0 |
| Missing cells (%) | 1.6% | 0.0% |
| Duplicate rows | 449 | 0 |
| Duplicate rows (%) | 2.0% | 0.0% |
| Total size in memory | 1.4 MiB | 551.4 KiB |
| Average record size in memory | 64.0 B | 28.0 B |
Variable types
| Original Data | Synthetic Data | |
|---|---|---|
| Numeric | 7 | 7 |
| Original Data | Synthetic Data | |
|---|---|---|
| Dataset has 449 (2.0%) duplicate rows | Alert not present in | Duplicates |
mis_and_disinformation is highly overall correlated with mis_and_disinformation_male and 5 other fields | mis_and_disinformation is highly overall correlated with mis_and_disinformation_male and 4 other fields | High Correlation |
mis_and_disinformation_male is highly overall correlated with mis_and_disinformation and 5 other fields | mis_and_disinformation_male is highly overall correlated with mis_and_disinformation and 4 other fields | High Correlation |
mis_and_disinformation_female is highly overall correlated with mis_and_disinformation and 5 other fields | mis_and_disinformation_female is highly overall correlated with mis_and_disinformation and 5 other fields | High Correlation |
myths is highly overall correlated with mis_and_disinformation and 5 other fields | myths is highly overall correlated with mis_and_disinformation and 4 other fields | High Correlation |
myths_female is highly overall correlated with mis_and_disinformation and 5 other fields | myths_female is highly overall correlated with mis_and_disinformation and 4 other fields | High Correlation |
myths_male is highly overall correlated with mis_and_disinformation and 5 other fields | myths_male is highly overall correlated with mis_and_disinformation and 4 other fields | High Correlation |
new_vaccinations_smoothed is highly overall correlated with mis_and_disinformation and 5 other fields | new_vaccinations_smoothed is highly overall correlated with mis_and_disinformation_female | High Correlation |
new_vaccinations_smoothed has 2458 (10.9%) missing values | Alert not present in | Missing |
mis_and_disinformation has 3530 (15.6%) zeros | Alert not present in | Zeros |
mis_and_disinformation_male has 6983 (30.9%) zeros | Alert not present in | Zeros |
mis_and_disinformation_female has 8940 (39.5%) zeros | Alert not present in | Zeros |
myths has 6231 (27.5%) zeros | Alert not present in | Zeros |
myths_female has 11337 (50.1%) zeros | Alert not present in | Zeros |
myths_male has 9890 (43.7%) zeros | Alert not present in | Zeros |
new_vaccinations_smoothed has 304 (1.3%) zeros | Alert not present in | Zeros |
| Alert not present in | mis_and_disinformation_female is highly skewed (γ1 = 33.46757126) | Skewed |
| Alert not present in | myths is highly skewed (γ1 = 87.50576782) | Skewed |
| Alert not present in | new_vaccinations_smoothed is highly skewed (γ1 = 29.18977356) | Skewed |
Reproduction
| Original Data | Synthetic Data | |
|---|---|---|
| Analysis started | 2023-01-21 05:54:14.715790 | 2023-01-21 05:54:24.229833 |
| Analysis finished | 2023-01-21 05:54:24.203088 | 2023-01-21 05:54:33.758261 |
| Duration | 9.49 seconds | 9.53 seconds |
| Software version | pandas-profiling vv3.6.2 | pandas-profiling vv3.6.2 |
| Download configuration | config.json | config.json |
mis_and_disinformation
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 1503 | 20137 |
| Distinct (%) | 6.6% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 132.18904 | 194.96919 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.25749367 |
| Maximum | 9342 | 8974.4248 |
| Zeros | 3530 | 0 |
| Zeros (%) | 15.6% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.25749367 |
| 5-th percentile | 0 | 0.51735753 |
| Q1 | 2 | 1.1794579 |
| median | 8 | 4.1513669 |
| Q3 | 48 | 34.948195 |
| 95-th percentile | 720.05 | 1086.7275 |
| Maximum | 9342 | 8974.4248 |
| Range | 9342 | 8974.1673 |
| Interquartile range (IQR) | 46 | 33.768738 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 463.62593 | 714.19836 |
| Coefficient of variation (CV) | 3.5072949 | 3.6631345 |
| Kurtosis | 76.70032 | 41.696888 |
| Mean | 132.18904 | 194.96919 |
| Median Absolute Deviation (MAD) | 8 | 3.5383379 |
| Skewness | 7.4936843 | 5.928329 |
| Sum | 2990116 | 3930968.8 |
| Variance | 214949.01 | 510079.34 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3530 | 15.6% |
| 1 | 2071 | 9.2% |
| 2 | 1504 | 6.6% |
| 3 | 1110 | 4.9% |
| 4 | 958 | 4.2% |
| 5 | 716 | 3.2% |
| 6 | 613 | 2.7% |
| 7 | 597 | 2.6% |
| 8 | 453 | 2.0% |
| 9 | 412 | 1.8% |
| Other values (1493) | 10656 |
| Value | Count | Frequency (%) |
| 3.027581692 | 2 | < 0.1% |
| 1.549239397 | 2 | < 0.1% |
| 2.719955444 | 2 | < 0.1% |
| 2.921707392 | 2 | < 0.1% |
| 1.819850683 | 2 | < 0.1% |
| 0.907022059 | 2 | < 0.1% |
| 0.4632177353 | 2 | < 0.1% |
| 0.5650966167 | 2 | < 0.1% |
| 0.7491480708 | 2 | < 0.1% |
| 0.6387518644 | 2 | < 0.1% |
| Other values (20127) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 3530 | |
| 1 | 2071 | |
| 2 | 1504 | |
| 3 | 1110 | 4.9% |
| 4 | 958 | 4.2% |
| 5 | 716 | 3.2% |
| 6 | 613 | 2.7% |
| 7 | 597 | 2.6% |
| 8 | 453 | 2.0% |
| 9 | 412 | 1.8% |
| Value | Count | Frequency (%) |
| 0.2574936748 | 1 | |
| 0.2624304593 | 1 | |
| 0.2693678439 | 1 | |
| 0.2722857893 | 1 | |
| 0.2735600471 | 1 | |
| 0.285967499 | 1 | |
| 0.2899042368 | 1 | |
| 0.2900577188 | 1 | |
| 0.2907913625 | 1 | |
| 0.2923562825 | 1 |
| Value | Count | Frequency (%) |
| 0.2574936748 | 1 | |
| 0.2624304593 | 1 | |
| 0.2693678439 | 1 | |
| 0.2722857893 | 1 | |
| 0.2735600471 | 1 | |
| 0.285967499 | 1 | |
| 0.2899042368 | 1 | |
| 0.2900577188 | 1 | |
| 0.2907913625 | 1 | |
| 0.2923562825 | 1 |
| Value | Count | Frequency (%) |
| 0 | 3530 | |
| 1 | 2071 | |
| 2 | 1504 | |
| 3 | 1110 | 5.5% |
| 4 | 958 | 4.8% |
| 5 | 716 | 3.6% |
| 6 | 613 | 3.0% |
| 7 | 597 | 3.0% |
| 8 | 453 | 2.2% |
| 9 | 412 | 2.0% |
mis_and_disinformation_male
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 912 | 20145 |
| Distinct (%) | 4.0% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 51.710477 | 45.556214 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0017139363 |
| Maximum | 3351 | 3319.9016 |
| Zeros | 6983 | 0 |
| Zeros (%) | 30.9% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0017139363 |
| 5-th percentile | 0 | 0.0040189742 |
| Q1 | 0 | 0.012929554 |
| median | 3 | 0.075563148 |
| Q3 | 17 | 1.4213123 |
| 95-th percentile | 294 | 169.28096 |
| Maximum | 3351 | 3319.9016 |
| Range | 3351 | 3319.8999 |
| Interquartile range (IQR) | 17 | 1.4083827 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 179.22929 | 233.83656 |
| Coefficient of variation (CV) | 3.466015 | 5.1329235 |
| Kurtosis | 67.376797 | 72.5485 |
| Mean | 51.710477 | 45.556214 |
| Median Absolute Deviation (MAD) | 3 | 0.070963144 |
| Skewness | 7.0232094 | 7.911963 |
| Sum | 1169691 | 918504.39 |
| Variance | 32123.138 | 54679.535 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 6983 | |
| 1 | 2532 | 11.2% |
| 2 | 1608 | 7.1% |
| 3 | 1156 | 5.1% |
| 4 | 886 | 3.9% |
| 5 | 687 | 3.0% |
| 6 | 562 | 2.5% |
| 7 | 443 | 2.0% |
| 8 | 344 | 1.5% |
| 10 | 313 | 1.4% |
| Other values (902) | 7106 |
| Value | Count | Frequency (%) |
| 0.007497055456 | 2 | < 0.1% |
| 0.01658315025 | 2 | < 0.1% |
| 142.8185272 | 2 | < 0.1% |
| 0.009448382072 | 2 | < 0.1% |
| 0.2914951742 | 2 | < 0.1% |
| 0.0579393208 | 2 | < 0.1% |
| 0.01699636504 | 2 | < 0.1% |
| 0.06248620898 | 2 | < 0.1% |
| 0.1034393981 | 2 | < 0.1% |
| 1.121963143 | 2 | < 0.1% |
| Other values (20135) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 6983 | |
| 1 | 2532 | 11.2% |
| 2 | 1608 | 7.1% |
| 3 | 1156 | 5.1% |
| 4 | 886 | 3.9% |
| 5 | 687 | 3.0% |
| 6 | 562 | 2.5% |
| 7 | 443 | 2.0% |
| 8 | 344 | 1.5% |
| 9 | 300 | 1.3% |
| Value | Count | Frequency (%) |
| 0.001713936334 | 1 | |
| 0.001778369537 | 1 | |
| 0.001795734162 | 1 | |
| 0.001853806083 | 1 | |
| 0.001881818054 | 1 | |
| 0.001883565099 | 1 | |
| 0.001904274337 | 1 | |
| 0.001975537278 | 1 | |
| 0.001990786754 | 1 | |
| 0.0020405096 | 1 |
| Value | Count | Frequency (%) |
| 0.001713936334 | 1 | |
| 0.001778369537 | 1 | |
| 0.001795734162 | 1 | |
| 0.001853806083 | 1 | |
| 0.001881818054 | 1 | |
| 0.001883565099 | 1 | |
| 0.001904274337 | 1 | |
| 0.001975537278 | 1 | |
| 0.001990786754 | 1 | |
| 0.0020405096 | 1 |
| Value | Count | Frequency (%) |
| 0 | 6983 | |
| 1 | 2532 | 12.6% |
| 2 | 1608 | 8.0% |
| 3 | 1156 | 5.7% |
| 4 | 886 | 4.4% |
| 5 | 687 | 3.4% |
| 6 | 562 | 2.8% |
| 7 | 443 | 2.2% |
| 8 | 344 | 1.7% |
| 9 | 300 | 1.5% |
mis_and_disinformation_female
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 717 | 20141 |
| Distinct (%) | 3.2% | 99.9% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 32.971927 | 0.63210664 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 4.8112943 × 10-6 |
| Maximum | 2745 | 375.33386 |
| Zeros | 8940 | 0 |
| Zeros (%) | 39.5% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 4.8112943 × 10-6 |
| 5-th percentile | 0 | 3.4263846 × 10-5 |
| Q1 | 0 | 0.00016232475 |
| median | 1 | 0.0010686517 |
| Q3 | 8 | 0.018942724 |
| 95-th percentile | 171 | 1.6793667 |
| Maximum | 2745 | 375.33386 |
| Range | 2745 | 375.33386 |
| Interquartile range (IQR) | 8 | 0.018780399 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 127.97281 | 6.1599326 |
| Coefficient of variation (CV) | 3.8812657 | 9.7450845 |
| Kurtosis | 86.162925 | 1547.1344 |
| Mean | 32.971927 | 0.63210664 |
| Median Absolute Deviation (MAD) | 1 | 0.001024836 |
| Skewness | 7.979837 | 33.467571 |
| Sum | 745825 | 12744.534 |
| Variance | 16377.041 | 37.944767 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8940 | |
| 1 | 2841 | 12.6% |
| 2 | 1752 | 7.7% |
| 3 | 1078 | 4.8% |
| 4 | 738 | 3.3% |
| 5 | 556 | 2.5% |
| 6 | 446 | 2.0% |
| 7 | 369 | 1.6% |
| 8 | 291 | 1.3% |
| 9 | 244 | 1.1% |
| Other values (707) | 5365 |
| Value | Count | Frequency (%) |
| 7.466223178 × 10-5 | 2 | < 0.1% |
| 0.008932402357 | 2 | < 0.1% |
| 4.873222497 × 10-5 | 2 | < 0.1% |
| 0.0001398147579 | 2 | < 0.1% |
| 0.0003148161049 | 2 | < 0.1% |
| 0.001261505648 | 2 | < 0.1% |
| 0.7851961255 | 2 | < 0.1% |
| 0.0003220156941 | 2 | < 0.1% |
| 0.0003940643219 | 2 | < 0.1% |
| 0.0001623247517 | 2 | < 0.1% |
| Other values (20131) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 8940 | |
| 1 | 2841 | 12.6% |
| 2 | 1752 | 7.7% |
| 3 | 1078 | 4.8% |
| 4 | 738 | 3.3% |
| 5 | 556 | 2.5% |
| 6 | 446 | 2.0% |
| 7 | 369 | 1.6% |
| 8 | 291 | 1.3% |
| 9 | 244 | 1.1% |
| Value | Count | Frequency (%) |
| 4.811294275 × 10-6 | 1 | |
| 5.135927495 × 10-6 | 1 | |
| 5.690041235 × 10-6 | 1 | |
| 5.765156857 × 10-6 | 1 | |
| 5.809421054 × 10-6 | 1 | |
| 5.9827521 × 10-6 | 1 | |
| 6.099553048 × 10-6 | 1 | |
| 6.262137958 × 10-6 | 1 | |
| 6.278678484 × 10-6 | 1 | |
| 6.307185231 × 10-6 | 1 |
| Value | Count | Frequency (%) |
| 4.811294275 × 10-6 | 1 | |
| 5.135927495 × 10-6 | 1 | |
| 5.690041235 × 10-6 | 1 | |
| 5.765156857 × 10-6 | 1 | |
| 5.809421054 × 10-6 | 1 | |
| 5.9827521 × 10-6 | 1 | |
| 6.099553048 × 10-6 | 1 | |
| 6.262137958 × 10-6 | 1 | |
| 6.278678484 × 10-6 | 1 | |
| 6.307185231 × 10-6 | 1 |
| Value | Count | Frequency (%) |
| 0 | 8940 | |
| 1 | 2841 | 14.1% |
| 2 | 1752 | 8.7% |
| 3 | 1078 | 5.3% |
| 4 | 738 | 3.7% |
| 5 | 556 | 2.8% |
| 6 | 446 | 2.2% |
| 7 | 369 | 1.8% |
| 8 | 291 | 1.4% |
| 9 | 244 | 1.2% |
myths
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 849 | 20093 |
| Distinct (%) | 3.8% | 99.7% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 47.737622 | 5.1294338 × 10-7 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 5.5744087 × 10-13 |
| Maximum | 4204 | 0.0021937087 |
| Zeros | 6231 | 0 |
| Zeros (%) | 27.5% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 5.5744087 × 10-13 |
| 5-th percentile | 0 | 4.1772176 × 10-12 |
| Q1 | 0 | 2.084769 × 10-11 |
| median | 4 | 1.5116586 × 10-10 |
| Q3 | 24 | 2.2952317 × 10-9 |
| 95-th percentile | 228 | 1.928308 × 10-7 |
| Maximum | 4204 | 0.0021937087 |
| Range | 4204 | 0.0021937087 |
| Interquartile range (IQR) | 24 | 2.274384 × 10-9 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 157.69701 | 1.9959067 × 10-5 |
| Coefficient of variation (CV) | 3.3034115 | 38.910857 |
| Kurtosis | 84.481684 | 8622.3672 |
| Mean | 47.737622 | 5.1294338 × 10-7 |
| Median Absolute Deviation (MAD) | 4 | 1.4584707 × 10-10 |
| Skewness | 7.4140392 | 87.505768 |
| Sum | 1079825 | 0.010341964 |
| Variance | 24868.347 | 3.9836431 × 10-10 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 6231 | |
| 1 | 2581 | 11.4% |
| 2 | 1495 | 6.6% |
| 3 | 997 | 4.4% |
| 4 | 788 | 3.5% |
| 5 | 636 | 2.8% |
| 6 | 556 | 2.5% |
| 7 | 442 | 2.0% |
| 8 | 387 | 1.7% |
| 9 | 340 | 1.5% |
| Other values (839) | 8167 |
| Value | Count | Frequency (%) |
| 1.600090373 × 10-11 | 2 | < 0.1% |
| 2.036454165 × 10-11 | 2 | < 0.1% |
| 2.708205535 × 10-11 | 2 | < 0.1% |
| 1.148236627 × 10-11 | 2 | < 0.1% |
| 1.148199938 × 10-10 | 2 | < 0.1% |
| 3.621257461 × 10-11 | 2 | < 0.1% |
| 4.528115383 × 10-10 | 2 | < 0.1% |
| 3.017615021 × 10-12 | 2 | < 0.1% |
| 3.179078528 × 10-11 | 2 | < 0.1% |
| 1.607396882 × 10-9 | 2 | < 0.1% |
| Other values (20083) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 6231 | |
| 1 | 2581 | |
| 2 | 1495 | 6.6% |
| 3 | 997 | 4.4% |
| 4 | 788 | 3.5% |
| 5 | 636 | 2.8% |
| 6 | 556 | 2.5% |
| 7 | 442 | 2.0% |
| 8 | 387 | 1.7% |
| 9 | 340 | 1.5% |
| Value | Count | Frequency (%) |
| 5.574408665 × 10-13 | 1 | |
| 7.25037543 × 10-13 | 1 | |
| 7.888903289 × 10-13 | 1 | |
| 8.03220229 × 10-13 | 1 | |
| 8.076476228 × 10-13 | 1 | |
| 8.601234959 × 10-13 | 1 | |
| 8.87349005 × 10-13 | 1 | |
| 8.896876291 × 10-13 | 1 | |
| 9.042051504 × 10-13 | 1 | |
| 9.164562555 × 10-13 | 1 |
| Value | Count | Frequency (%) |
| 5.574408665 × 10-13 | 1 | |
| 7.25037543 × 10-13 | 1 | |
| 7.888903289 × 10-13 | 1 | |
| 8.03220229 × 10-13 | 1 | |
| 8.076476228 × 10-13 | 1 | |
| 8.601234959 × 10-13 | 1 | |
| 8.87349005 × 10-13 | 1 | |
| 8.896876291 × 10-13 | 1 | |
| 9.042051504 × 10-13 | 1 | |
| 9.164562555 × 10-13 | 1 |
| Value | Count | Frequency (%) |
| 0 | 6231 | |
| 1 | 2581 | |
| 2 | 1495 | 7.4% |
| 3 | 997 | 4.9% |
| 4 | 788 | 3.9% |
| 5 | 636 | 3.2% |
| 6 | 556 | 2.8% |
| 7 | 442 | 2.2% |
| 8 | 387 | 1.9% |
| 9 | 340 | 1.7% |
myths_female
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 367 | 20128 |
| Distinct (%) | 1.6% | 99.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 11.846375 | 0.88741908 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0009409269 |
| Maximum | 1521 | 146.70242 |
| Zeros | 11337 | 0 |
| Zeros (%) | 50.1% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0009409269 |
| 5-th percentile | 0 | 0.0020053099 |
| Q1 | 0 | 0.004977763 |
| median | 0 | 0.017849388 |
| Q3 | 4 | 0.12708842 |
| 95-th percentile | 52 | 3.6867018 |
| Maximum | 1521 | 146.70242 |
| Range | 1521 | 146.70148 |
| Interquartile range (IQR) | 4 | 0.12211066 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 44.95295 | 4.4033723 |
| Coefficient of variation (CV) | 3.7946587 | 4.9619987 |
| Kurtosis | 151.20784 | 219.98198 |
| Mean | 11.846375 | 0.88741908 |
| Median Absolute Deviation (MAD) | 0 | 0.015353912 |
| Skewness | 9.3718391 | 11.610792 |
| Sum | 267965 | 17892.143 |
| Variance | 2020.7677 | 19.389688 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 11337 | |
| 1 | 2712 | 12.0% |
| 2 | 1432 | 6.3% |
| 3 | 921 | 4.1% |
| 4 | 667 | 2.9% |
| 5 | 438 | 1.9% |
| 6 | 392 | 1.7% |
| 7 | 322 | 1.4% |
| 9 | 257 | 1.1% |
| 8 | 214 | 0.9% |
| Other values (357) | 3928 | 17.4% |
| Value | Count | Frequency (%) |
| 0.1657812148 | 2 | < 0.1% |
| 0.01287199184 | 2 | < 0.1% |
| 0.01083931886 | 2 | < 0.1% |
| 1.200712085 | 2 | < 0.1% |
| 0.01262701303 | 2 | < 0.1% |
| 0.003183210036 | 2 | < 0.1% |
| 0.002357335528 | 2 | < 0.1% |
| 0.0109860776 | 2 | < 0.1% |
| 0.01557181682 | 2 | < 0.1% |
| 0.06065730378 | 2 | < 0.1% |
| Other values (20118) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 11337 | |
| 1 | 2712 | 12.0% |
| 2 | 1432 | 6.3% |
| 3 | 921 | 4.1% |
| 4 | 667 | 2.9% |
| 5 | 438 | 1.9% |
| 6 | 392 | 1.7% |
| 7 | 322 | 1.4% |
| 8 | 214 | 0.9% |
| 9 | 257 | 1.1% |
| Value | Count | Frequency (%) |
| 0.0009409268969 | 1 | |
| 0.0009473963291 | 1 | |
| 0.001031626598 | 1 | |
| 0.001043590135 | 1 | |
| 0.001104090828 | 1 | |
| 0.001116379164 | 1 | |
| 0.001116638887 | 1 | |
| 0.001135243801 | 1 | |
| 0.001139126485 | 1 | |
| 0.001139322994 | 1 |
| Value | Count | Frequency (%) |
| 0.0009409268969 | 1 | |
| 0.0009473963291 | 1 | |
| 0.001031626598 | 1 | |
| 0.001043590135 | 1 | |
| 0.001104090828 | 1 | |
| 0.001116379164 | 1 | |
| 0.001116638887 | 1 | |
| 0.001135243801 | 1 | |
| 0.001139126485 | 1 | |
| 0.001139322994 | 1 |
| Value | Count | Frequency (%) |
| 0 | 11337 | |
| 1 | 2712 | 13.5% |
| 2 | 1432 | 7.1% |
| 3 | 921 | 4.6% |
| 4 | 667 | 3.3% |
| 5 | 438 | 2.2% |
| 6 | 392 | 1.9% |
| 7 | 322 | 1.6% |
| 8 | 214 | 1.1% |
| 9 | 257 | 1.3% |
myths_male
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 456 | 20127 |
| Distinct (%) | 2.0% | 99.8% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 17.368126 | 2.4123741 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0014087727 |
| Maximum | 1365 | 523.39032 |
| Zeros | 9890 | 0 |
| Zeros (%) | 43.7% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 0.0014087727 |
| 5-th percentile | 0 | 0.0027560253 |
| Q1 | 0 | 0.0061015229 |
| median | 1 | 0.022812185 |
| Q3 | 7 | 0.23624865 |
| 95-th percentile | 88 | 10.077938 |
| Maximum | 1365 | 523.39032 |
| Range | 1365 | 523.38891 |
| Interquartile range (IQR) | 7 | 0.23014713 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 58.291022 | 12.685512 |
| Coefficient of variation (CV) | 3.3562068 | 5.2585176 |
| Kurtosis | 64.714635 | 355.3302 |
| Mean | 17.368126 | 2.4123741 |
| Median Absolute Deviation (MAD) | 1 | 0.019612003 |
| Skewness | 6.7263113 | 13.915636 |
| Sum | 392867 | 48638.286 |
| Variance | 3397.8432 | 160.9222 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 9890 | |
| 1 | 2596 | 11.5% |
| 2 | 1506 | 6.7% |
| 3 | 1018 | 4.5% |
| 4 | 746 | 3.3% |
| 5 | 639 | 2.8% |
| 6 | 435 | 1.9% |
| 7 | 402 | 1.8% |
| 8 | 308 | 1.4% |
| 9 | 307 | 1.4% |
| Other values (446) | 4773 |
| Value | Count | Frequency (%) |
| 0.113289766 | 2 | < 0.1% |
| 0.03909193352 | 2 | < 0.1% |
| 0.002846202813 | 2 | < 0.1% |
| 0.01819017529 | 2 | < 0.1% |
| 0.1021878049 | 2 | < 0.1% |
| 0.02539433353 | 2 | < 0.1% |
| 0.007072576787 | 2 | < 0.1% |
| 0.003821014892 | 2 | < 0.1% |
| 0.07215686142 | 2 | < 0.1% |
| 0.03538630158 | 2 | < 0.1% |
| Other values (20117) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 9890 | |
| 1 | 2596 | 11.5% |
| 2 | 1506 | 6.7% |
| 3 | 1018 | 4.5% |
| 4 | 746 | 3.3% |
| 5 | 639 | 2.8% |
| 6 | 435 | 1.9% |
| 7 | 402 | 1.8% |
| 8 | 308 | 1.4% |
| 9 | 307 | 1.4% |
| Value | Count | Frequency (%) |
| 0.001408772659 | 1 | |
| 0.001488958253 | 1 | |
| 0.001499351813 | 1 | |
| 0.001503663138 | 1 | |
| 0.001528530149 | 1 | |
| 0.001529715722 | 1 | |
| 0.001552285627 | 1 | |
| 0.00155452115 | 1 | |
| 0.001575703733 | 1 | |
| 0.001578219817 | 1 |
| Value | Count | Frequency (%) |
| 0.001408772659 | 1 | |
| 0.001488958253 | 1 | |
| 0.001499351813 | 1 | |
| 0.001503663138 | 1 | |
| 0.001528530149 | 1 | |
| 0.001529715722 | 1 | |
| 0.001552285627 | 1 | |
| 0.00155452115 | 1 | |
| 0.001575703733 | 1 | |
| 0.001578219817 | 1 |
| Value | Count | Frequency (%) |
| 0 | 9890 | |
| 1 | 2596 | 12.9% |
| 2 | 1506 | 7.5% |
| 3 | 1018 | 5.0% |
| 4 | 746 | 3.7% |
| 5 | 639 | 3.2% |
| 6 | 435 | 2.2% |
| 7 | 402 | 2.0% |
| 8 | 308 | 1.5% |
| 9 | 307 | 1.5% |
new_vaccinations_smoothed
Real number (ℝ)
| Original Data | Synthetic Data | |
|---|---|---|
| Distinct | 15978 | 20137 |
| Distinct (%) | 79.2% | 99.9% |
| Missing | 2458 | 0 |
| Missing (%) | 10.9% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 273046.67 | 2668.5122 |
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 1.3695122 |
| Maximum | 10037995 | 1247604.2 |
| Zeros | 304 | 0 |
| Zeros (%) | 1.3% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 353.4 KiB | 78.9 KiB |
Quantile statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Minimum | 0 | 1.3695122 |
| 5-th percentile | 396 | 12.471789 |
| Q1 | 6362 | 100.70676 |
| median | 41651 | 234.97049 |
| Q3 | 217664.75 | 690.17796 |
| 95-th percentile | 1266859 | 6742.1515 |
| Maximum | 10037995 | 1247604.2 |
| Range | 10037995 | 1247602.9 |
| Interquartile range (IQR) | 211302.75 | 589.4712 |
Descriptive statistics
| Original Data | Synthetic Data | |
|---|---|---|
| Standard deviation | 778429.33 | 23702.07 |
| Coefficient of variation (CV) | 2.8509021 | 8.8821293 |
| Kurtosis | 51.553296 | 1142.5132 |
| Mean | 273046.67 | 2668.5122 |
| Median Absolute Deviation (MAD) | 40250 | 179.19913 |
| Skewness | 6.4950105 | 29.189774 |
| Sum | 5.505167 × 109 | 53802543 |
| Variance | 6.0595222 × 1011 | 5.617881 × 108 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 304 | 1.3% |
| 1926 | 71 | 0.3% |
| 6049 | 61 | 0.3% |
| 37853 | 57 | 0.3% |
| 4431 | 52 | 0.2% |
| 2915 | 51 | 0.2% |
| 34002 | 50 | 0.2% |
| 201797 | 50 | 0.2% |
| 1401 | 45 | 0.2% |
| 23552 | 43 | 0.2% |
| Other values (15968) | 19378 | |
| (Missing) | 2458 | 10.9% |
| Value | Count | Frequency (%) |
| 210.4053955 | 2 | < 0.1% |
| 797.4474487 | 2 | < 0.1% |
| 91.17314148 | 2 | < 0.1% |
| 125.5846176 | 2 | < 0.1% |
| 644.6708984 | 2 | < 0.1% |
| 681.6853027 | 2 | < 0.1% |
| 264.7999878 | 2 | < 0.1% |
| 716.2763672 | 2 | < 0.1% |
| 193.1675568 | 2 | < 0.1% |
| 343.0980835 | 2 | < 0.1% |
| Other values (20127) | 20142 |
| Value | Count | Frequency (%) |
| 0 | 304 | |
| 2 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 28 | 2 | < 0.1% |
| 33 | 2 | < 0.1% |
| 34 | 2 | < 0.1% |
| 35 | 2 | < 0.1% |
| 36 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.3695122 | 1 | |
| 1.500377178 | 1 | |
| 1.564958572 | 1 | |
| 1.569641471 | 1 | |
| 1.609022021 | 1 | |
| 1.63750267 | 1 | |
| 1.647035837 | 1 | |
| 1.679669738 | 1 | |
| 1.689743638 | 1 | |
| 1.756750226 | 1 |
| Value | Count | Frequency (%) |
| 1.3695122 | 1 | |
| 1.500377178 | 1 | |
| 1.564958572 | 1 | |
| 1.569641471 | 1 | |
| 1.609022021 | 1 | |
| 1.63750267 | 1 | |
| 1.647035837 | 1 | |
| 1.679669738 | 1 | |
| 1.689743638 | 1 | |
| 1.756750226 | 1 |
| Value | Count | Frequency (%) |
| 0 | 304 | |
| 2 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 28 | 2 | < 0.1% |
| 33 | 2 | < 0.1% |
| 34 | 2 | < 0.1% |
| 35 | 2 | < 0.1% |
| 36 | 3 | < 0.1% |
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
Synthetic Data
Original Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| mis_and_disinformation | 1.000 | 0.944 | 0.911 | 0.889 | 0.824 | 0.858 | 0.571 |
| mis_and_disinformation_male | 0.944 | 1.000 | 0.869 | 0.866 | 0.812 | 0.858 | 0.544 |
| mis_and_disinformation_female | 0.911 | 0.869 | 1.000 | 0.845 | 0.814 | 0.834 | 0.530 |
| myths | 0.889 | 0.866 | 0.845 | 1.000 | 0.886 | 0.923 | 0.602 |
| myths_female | 0.824 | 0.812 | 0.814 | 0.886 | 1.000 | 0.834 | 0.549 |
| myths_male | 0.858 | 0.858 | 0.834 | 0.923 | 0.834 | 1.000 | 0.563 |
| new_vaccinations_smoothed | 0.571 | 0.544 | 0.530 | 0.602 | 0.549 | 0.563 | 1.000 |
Synthetic Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| mis_and_disinformation | 1.000 | 0.992 | 0.957 | 0.807 | 0.961 | 0.973 | -0.468 |
| mis_and_disinformation_male | 0.992 | 1.000 | 0.944 | 0.828 | 0.974 | 0.986 | -0.449 |
| mis_and_disinformation_female | 0.957 | 0.944 | 1.000 | 0.703 | 0.931 | 0.907 | -0.502 |
| myths | 0.807 | 0.828 | 0.703 | 1.000 | 0.857 | 0.862 | -0.021 |
| myths_female | 0.961 | 0.974 | 0.931 | 0.857 | 1.000 | 0.984 | -0.371 |
| myths_male | 0.973 | 0.986 | 0.907 | 0.862 | 0.984 | 1.000 | -0.406 |
| new_vaccinations_smoothed | -0.468 | -0.449 | -0.502 | -0.021 | -0.371 | -0.406 | 1.000 |
Original Data
A simple visualization of nullity by column.
Synthetic Data
A simple visualization of nullity by column.
Original Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Synthetic Data
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Original Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN |
| 1 | 1 | 0 | 0 | 1 | 0 | 1 | NaN |
| 2 | 2 | 0 | 0 | 0 | 0 | 0 | NaN |
| 3 | 2 | 0 | 0 | 0 | 0 | 0 | NaN |
| 4 | 0 | 0 | 0 | 1 | 0 | 0 | NaN |
| 5 | 1 | 0 | 0 | 1 | 0 | 0 | NaN |
| 6 | 1 | 0 | 1 | 0 | 0 | 0 | NaN |
| 7 | 0 | 0 | 0 | 0 | 0 | 0 | NaN |
| 8 | 1 | 1 | 0 | 0 | 0 | 0 | NaN |
| 9 | 0 | 0 | 0 | 0 | 0 | 0 | NaN |
Synthetic Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| 0 | 0.881926 | 0.007294 | 0.000096 | 1.441762e-11 | 0.002502 | 0.003399 | 433.755737 |
| 1 | 0.561649 | 0.004337 | 0.000046 | 4.396738e-12 | 0.001904 | 0.002689 | 206.921494 |
| 2 | 0.799067 | 0.006421 | 0.000049 | 5.791453e-12 | 0.002390 | 0.003687 | 171.967133 |
| 3 | 27.981241 | 0.831350 | 0.056760 | 9.115203e-11 | 0.123705 | 0.081263 | 270.183807 |
| 4 | 3417.765625 | 828.431335 | 24.043804 | 2.883288e-07 | 11.475373 | 39.915588 | 23.367708 |
| 5 | 1.287363 | 0.015360 | 0.000077 | 4.580738e-11 | 0.003487 | 0.004829 | 806.266113 |
| 6 | 2.275546 | 0.031279 | 0.000325 | 1.437807e-11 | 0.010065 | 0.014703 | 222.214279 |
| 7 | 0.476046 | 0.003918 | 0.000020 | 6.979158e-11 | 0.005062 | 0.006368 | 1806.305420 |
| 8 | 2.384856 | 0.027774 | 0.001296 | 6.562452e-12 | 0.009462 | 0.008861 | 231.261658 |
| 9 | 0.406959 | 0.002802 | 0.000010 | 1.710320e-10 | 0.003730 | 0.004456 | 3761.653076 |
Original Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| 22610 | 2 | 0 | 0 | 0 | 0 | 0 | NaN |
| 22611 | 1 | 0 | 0 | 1 | 0 | 1 | NaN |
| 22612 | 0 | 0 | 0 | 0 | 0 | 0 | NaN |
| 22613 | 32 | 19 | 8 | 1 | 0 | 1 | NaN |
| 22614 | 39 | 15 | 5 | 9 | 2 | 6 | NaN |
| 22615 | 61 | 40 | 10 | 7 | 1 | 4 | NaN |
| 22616 | 42 | 20 | 7 | 9 | 2 | 1 | NaN |
| 22617 | 40 | 25 | 3 | 16 | 2 | 8 | NaN |
| 22618 | 43 | 22 | 6 | 5 | 0 | 2 | NaN |
| 22619 | 55 | 28 | 14 | 11 | 1 | 5 | NaN |
Synthetic Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | |
|---|---|---|---|---|---|---|---|
| 20152 | 111.297600 | 6.063241 | 0.101819 | 2.119309e-09 | 0.239296 | 0.670247 | 29.048500 |
| 20153 | 8.541916 | 0.178145 | 0.001435 | 6.160439e-10 | 0.040290 | 0.054823 | 410.363800 |
| 20154 | 0.528851 | 0.006146 | 0.000017 | 1.129179e-10 | 0.003515 | 0.005103 | 8263.362305 |
| 20155 | 0.960515 | 0.012357 | 0.000036 | 3.918914e-10 | 0.004379 | 0.005664 | 5405.026367 |
| 20156 | 1.010556 | 0.010243 | 0.000061 | 4.024147e-11 | 0.003856 | 0.005503 | 400.746857 |
| 20157 | 0.782099 | 0.008983 | 0.000158 | 5.700643e-12 | 0.004049 | 0.004647 | 341.378448 |
| 20158 | 1.091776 | 0.013979 | 0.000135 | 6.930960e-12 | 0.005139 | 0.007120 | 221.266937 |
| 20159 | 0.532489 | 0.006224 | 0.000601 | 1.007859e-10 | 0.017582 | 0.004054 | 7451.645996 |
| 20160 | 340.912537 | 35.149246 | 0.402780 | 6.029190e-09 | 1.361110 | 2.381244 | 10.289628 |
| 20161 | 5.310801 | 0.051027 | 0.001397 | 4.791975e-10 | 0.013709 | 0.016100 | 3292.263916 |
Original Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | # duplicates | |
|---|---|---|---|---|---|---|---|---|
| 178 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | 391 |
| 259 | 1 | 0 | 0 | 0 | 0 | 0 | NaN | 121 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0 | 95 |
| 194 | 0 | 0 | 0 | 1 | 0 | 0 | NaN | 72 |
| 302 | 1 | 1 | 0 | 0 | 0 | 0 | NaN | 58 |
| 333 | 2 | 0 | 0 | 0 | 0 | 0 | NaN | 57 |
| 358 | 2 | 1 | 0 | 0 | 0 | 0 | NaN | 36 |
| 283 | 1 | 0 | 1 | 0 | 0 | 0 | NaN | 30 |
| 266 | 1 | 0 | 0 | 1 | 0 | 0 | NaN | 27 |
| 202 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | 21 |
Synthetic Data
| mis_and_disinformation | mis_and_disinformation_male | mis_and_disinformation_female | myths | myths_female | myths_male | new_vaccinations_smoothed | # duplicates | |
|---|---|---|---|---|---|---|---|---|
| Dataset does not contain duplicate rows. | ||||||||